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10 20 30 40 50 60 70 

GGTACCCATC TAATACATTA ATAACAAGAG AGAGAATGGA TAATGCAATT ATTTATTTTT ATGGGAGGCT 
CCATGGGTAG ATTATGTAAT TATTGTTCTC TCTCTTACCT ATTACGTTAA TAAATAAAAA TACCCTCCGA 



80 90 100 110 120 130 140 

ATATTTTTAT CGGATTTTAG TAAATAACGG GGCAATTCGG TACTTAGGTA AAGCTACGTA TGACTATCGC 

TATAAAAATA GCCTAAAATC ATTTATTGCC CCGTTAAGCC ATGAATCCAT TTCGATGCAT ACTGATAGCG 

150 160 170 180 190 200 210 

TACCGCTACG GTAGTTGAAT TGGAATTCTT CGATAGCATC TGTTGTGTTG TTGCAGTTAG GGTACTTGAA 

ATGGCGATGC CATCAACTTA ACCTTAAGAA GCTATCGTAG ACAACACAAC AACGTCAATC CCATGAACTT 



220 230 240 250 260 270 280 

TAGCTCCAGC CGTGAAAACG AGGGGTTTTC GCAGGTTTTA TAGGATTGCC AAGTTAGACT AGGGCAATTC 
ATCGAGGTCG GCACTTTTGC TCCCCAAAAG CGTCCAAAAT ATCCTAACGG TTCAATCTGA TCCCGTTAAG 

290 300 310 320 330 340 350 

ATGTTCACGG TATTGTGTAG TATATGAAAA AGGAGATCTC CCAAACAATT TATAATTTTG TATAAGGGAG 
TACAAGTGCC ATAACACATC ATATACTTTT TCCTCTAGAG GGTTTGTTAA ATATTAAAAC ATATTCCCTC 



>AT-rich_regiorL_ 
I 

360 370 380 390 400 410 420 

AAATCGAACT TGAGGTGTCT AATTCACCAA CCGAGCTACT CCCTCCGTTT CATATATGTA TATACATATA 
TTTAGCTTGA ACTCCACAGA TTAAGTGGTT GGCTCGATGA GGGAGGCAAA GTATATACAT ATATGTATAT 



430 440 450 460 470 480 490 

TACGTATATA TACGTATATA CACATATACG TATATACATA TATGGTATAT ACATATATAT ATATATATAT 
ATGCATATAT ATGCATATAT GTGTATATGC ATATATGTAT ATACCATATA TGTATATATA TATATATATA 



500 510 520 530 540 550 560 

ATATATATAT ATGTGTGTGT GTGTATGTGG GGTGGCAATG CTAAAAAGTT TTATAATATG AACGGATGAA 

TATATATATA TACACACACA CACATACACC CCACCGTTAC GATTTTTCAA AATATTATAC TTGCCTACTT 

570 580 590 600 610 620 630 

GTACTATCCA CTAAGTCCCT ATAGTTTTCT GGCACTGTGT AGTATACGAA TGCACAATTA TATCCATAAA 

CATGATAGGT GATTCAGGGA TATCAAAAGA CCGTGACACA TCATATGCTT ACGTGTTAAT ATAGGTATTT 



640 650 660 570 680 690 700 

ATTGATATTA TATATTCGTC GCGACGAAAA TAAAGACATA ATATTCGGTA TACCATTTAT CCACGATATA 
TAACTATAAT ATATAAGCAG CGCTGCTTTT ATTTCTGTAT TATAAGCCAT ATGGTAAATA GGTGCTATAT 



710 720 730 740 750 760 770 

TCTAAATTCC ACTGATATAT CTAAATTCCA CTTGATCCCT TTTATGGATA AATTCTGGAT AACAATTACT 
AGATTTAAGG TGACTATATA GATTTAAGGT GAACTAGGGA AAATACCTAT TTAAGACCTA TTGTTAATGA 



780 790 800 810 820 830 840 

ACCAGCAGTA TATCCTACTA TCAGCGCACT GCACACCAAA CTACCCTCAC CCAGTAGTTA CAAACGCATA 
TGGTCGTCAT ATAGGATGAT AGTCGCGTGA CGTGTGGTTT GATGGGAGTG GGTCATCAAT GTTTGCGTAT 
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850 860 870 880 890 900 910 

TTTTGCCGTT AGTTAATTAT TATCCGGTAA AGAAGGTAAA GAAGATTGGT AGTAATCCAA AATTTTCCCA 
AAAACGGCAA TCAATTAATA ATAGGCCATT TCTTCCATTT CTTCTAACCA TCATTAGGTT TTAAAAGGGT 

920 930 940 950 960 970 980 

ACCCCAACCT CGGAACAAAA ACCGCGTAGT ATTTGTCGTA ACCAGGAGCA TCCGAGTCAT TAATTTACAC 
TGGGGTTGGA GCCTTGTTTT TGGCGCATCA TAAACAGCAT TGGTCCTCGT AGGCTCAGTA ATTAAATGTG 



>Transcription_start_site 

>CAAG_site 

I I 
990 1000 1010 1020 1030 1040 1050 

CCAAACACAA AAAATTAGCA GCACGCAGCC GCCTTCCCAA TCCTCTCCTC TCTCCTCTCC TCTTCTCCAA 

GGTTTGTGTT TTTTAATCGT CGTGCGTCGG CGGAAGGGTT AGGAGAGGAG AGAGGAGAGG AGAAGAGGTT 



1060 1070 1080 1090 1100 1110 1120 

GCGGCAATTC GCGCGAGGTT TTCTCCGATC AAACCCTCGA ATCCCCCCCT CGCGAATCCA TCGGAGGGTA 
CGCCGTTAAG CGCGCTCCAA AAGAGGCTAG TTTGGGAGCT TAGGGGGGGA GCGCTTAGGT AGCCTCCCAT 

1130 1140 1150 1160 1170 1180 

GCCCCGCGAT CCGCGTCGGC GAGAGCGGAT TCCGATTCCG CG ATG GAG CGG GTG TTC TCC GTG 
CGGGGCGCTA GGCGCAGCCG CTCTCGCCTA AGGCTAAGGC GC TAG CTC GCC CAC AAG AGG CAC 

M E R V F S V> 
a a EXONl a a > 

1190 1200 1210 1220 1230 1240 
GAG GAG ATC TCC GAC CCA TTC TGG GTC CCG CCT CCG CCG CCG CAG TCG GCG GCG GCG 
CTC CTC TAG AGG CTG GGT AAG ACC CAG GGC GGA GGC GGC GGC GTC AGC CGC CGC CGC 
EEISDPFWVPPPPPQSAAA> 
a a a a a a a a EXONl a a a a a a a a > 

1250 1250 1270 1280 1290 
GCC CAG CAG CAG GGC GGC GGC GGC GTG GCT TCG GGA GGT GGT GGT GGT GTA GCG GGG 
CGG GTC GTC GTC CCG CCG CCG CCG CAC CGA AGC CCT CCA CCA CCA CCA CAT CGC CCC 
AQQQGGGGVASGGGGGVAG> 
a a a a a a a a EXONl a a a a a a a a > 

1300 1310 1320 1330 1340 1350 
GGC GGC GGC GGC GGG AAC GCG ATG AAC CGG TGC CCG TCG GAG TGG TAG TTC CAG AAG 
CCG CCG CCG CCG CCC TTG CGC TAG TTG GCC ACG GGC AGC CTC ACC ATG AAG GTC TTC 
GGGGGNAMNRCPSEWYFQK> 
a a a a a a a a EXONl a a a a a a a a > 



1360 1370 1380 

TTT CTG GAG GAG GCG GTG CTC GAT AGC CCC 
AAA GAC CTC CTC CGC CAC GAG CTA TCG GGG 

FLEEAVLDSP 
a a a a a a a a EXONl_ 



1390 1400 1410 
GTC CCG AAC CCT AGC CCG AGG GCC GAA 
CAG GGC TTG GGA TCG GGC TCC CGG CTT 
VPNPSPRAE> 
a a a a a a a a : 
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1420 1430 1440 1450 1460 
GCG GGA GGG ATC AGG GGC GCA GGA GGG GTG GTG CCG GTC GAT GTT AAG GAG CCG GAG 
CGC OCT CCG TAG TCC CCG COT CCT CCC CAC CAC GGC CAG CTA CAA TTC GTC GGC GTC 
AGGIRGAGGVVPVDVKQPQ> 
a a a a a a a a EXONl a a a a a a a a > 

1470 1480 1490 1500 1510 

1520 

GTC TCG GCG GCG GCG ACG ACG AGC GCG GTG GTG GAC CCC GTG GAG TAG AAC GCG ATG 
GAG AGC CGC CGC CGC TGC TGC TCG CGC CAC CAC CTG GGG CAC CTC ATG TTG CGC TAG 
LSAAATTSAVVDPVEYNAM> 
a a a a a a a a EXONl a a a a a a a a .> 

1530 1540 1550 1560 1570 1580 

CTG AAG CAG AAG CTG GAG AAG GAC CTC GCC GCG GTC GCC ATG TGG AGG GTACAGC 
GAC TTC GTC TTC GAC CTC TTC CTG GAG CGG CGC CAG CGG TAC ACC TCC CATGTCG 



1590 1600 1610 1620 1630 1640 1650 

CATTCTCCCC CCCTCTAGTA CTCGAGAGCT TACTGAGATC GGCAATGCTA GCTACTGTTT GCATCGAATG 
GTAAGAGGGG GGGAGATCAT GAGCTCTCGA ATGACTCTAG CCGTTACGAT CGATGACAAA CGTAGCTTAC 

1660 1670 1680 1690 1700 1710 1720 

TTTATAGGTA TTTAGATCGG GCATTTCTAT AGACCAATGG CGTCCATGGT CTTGCAATGC GCTCTGTTGA 
AAATATCCAT AAATCTAGCC CGTAAAGATA TCTGGTTACC GCAGGTACCA GAACGTTACG CGAGACAACT 

1730 1740 1750 1760 1770 1780 1790 

GTGTCGGTGG TTGGTTCGAC TCATAGTATG TAGGGTTGTG CGTATGTACA AACGGAAGCT TCATAGACCT 
CACAGCCACC AACCAAGCTG AGTATCATAC ATCCCAACAC GCATACATGT TTGCCTTCGA AGTATCTGGA 

1800 1810 1820 1830 1840 1850 1860 

CGGTATTGAG ATTGCGATAT CGATGCAACC TGCGAATTGG CGATGTAATC AGTCATATTC TTACTAAACT 
GCCATAACTC TAACGCTATA GCTACGTTGG ACGCTTAACC GCTACATTAG TCAGTATAAG AATGATTTGA 

1870 1880 1890 1900 1910 1920 1930 

GCGAGACAGT GGTTTGTTTG CAATTGCAAT ATTTTTGTAT GGGGCTGCTT AAACTGTCAT TGCCTTTTTA 
CGCTCTGTCA CCAAACAAAC GTTAACGTTA TAAAAACATA CCCCGACGAA TTTGACAGTA ACGGAAAAAT 



1940 1950 1960 1970 1980 1990 2000 

GATTGGCAAT ATGTGACTTT ATGCAAGTAT TTGATTGGGC GGATCCAGGA ACAAAAAGTT GGGGGGATTC 
CTAACCGTTA TACACTGAAA TACGTTCATA AACTAACCCG CCTAGGTCCT TGTTTTTCAA CCCCCCTAAG 

2010 2020 2030 2040 2050 2060 2070 

AACATACCGA GTACACTGGC ATAAACACAT CATCTCAGTA TTAAACTATG CTAAAATGCT ATTAAGAGAC 
TTGTATGGCT CATGTGACCG TATTTGTGTA GTAGAGTCAT AATTTGATAC GATTTTACGA TAATTCTCTG 
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2080 2090 2100 2110 2120 2130 2140 

CTTTAGCACC TCTTATCTTA TCAACCATGG TGAAAAAATT GAAGGGGGGA CTCAGGGGGG TATCCATGGG 
GAAATCGTGG AGAATAGAAT AGTTGGTACC ACTTTTTTAA CTTCCCCCCT GAGTCCCCCC ATAGGTACCC 

2150 2160 2170 2180 2190 2200 2210 

TCCGATGGGT GCAGGGGGGA CTGAGTCCCC CCTGCACCCA CGTTGAATCC GCCCTGGCAT GCGTATAAGC 
AGGCTACCCA CGTCCCCCCT GACTCAGGGG GGACGTGGGT GCAACTTAGG CGGGACCGTA CGCATATTCG 

2220 2230 2240 2250 2260 2270 2280 

TGTCACAGCC ATTTCTAGGT GCTTGTGCTT AGTTGGGTGA TGTCAGCTTA ATTTGTCTTT TCTATGTCGT 
ACAGTGTCGG TAAAGATCCA CGAACACGAA TCAACCCACT ACAGTCGAAT TAAACAGAAA AGATACAGCA 

2290 2300 2310 2320 2330 2340 2350 

CATCGATTTT CT/^GAAACG AAAAATAGCC TATTTATGTG CTCCAGAATT TGATGATCCC TGGCCCTTCA 
GTAGCTAAAA GATTCTTTGC TTTTTATCGG ATAAATACAC GAGGTCTTAA ACTACTAGGG ACCGGGAAGT 

2360 2370 2380 2390 2400 2410 2420 

TTTGCTGAAA TTAGCCTATT TGTTGGTTGC CCTTCAGTTT TTTCCCAGCT TATGTTGTTG CAATGTGTGG 
AAACGACTTT AATCGGATAA ACAACCAACG GGAAGTCAAA AAAGGGTCGA ATACAACAAC GTTACACACC 

2430 2440 2450 2460 2470 2480 2490 

CTATGCCTCG TTTTGTGCCC TATAATTTAT TATTTGCAAT TCATTTTTGT ACATGACTTA AAATGACACT 
GATACGGAGC AAAACACGGG ATATT7VAATA ATAAACGTTA AGTAAAAACA TGTACTGAAT TTTACTGTGA 

2500 2510 2520 2530 2540 2550 2560 

AGAGCAACAT GCACTGATTG GTTATCCTAT AATCATTTAT GTAGTTCTGT TCATTTTATC ATGCTAGCTC 
TCTCGTTGTA CGTGACTAAC CAATAGGATA TTAGTAAATA CATCAAGACA AGTAAAATAG TACGATCGAG 

2570 2580 2590 2600 2610 2620 

ATGTCATTTT CATCTTCAG GCC TOT GGC ACA GTT CCA CCT GAG CGT CCT GGA GCT GGT TCA 
TACAGTAAAA GTAGAAGTC CGG AGA COG TGT CAA GGT GGA CTC GCA GGA CCT CGA CCA AGT 
ASGTVPPERPGAGS> 
b b b b b EXON2 b b b b b b > 



2630 2640 2650 2660 2670 2680 

TCC TTG CTG AAT GCA GAT GTT TCA CAC ATA GGC GCT CCT AAT TCC ATC GGA GGTACTTA 
AGG AAC GAC TTA CGT CTA CAA AGT GTG TAT CCG CGA GGA TTA AGG TAG CCT CCATGAAT 

SLLNADVSHIGAPNSIG> 
b b b b b b b EX0N2 b b ^b b b b b > 



2690 2700 2710 2720 2730 2740 2750 

TCTTATCTGG TTACATTTTC AGATTGTTAT GAAACTACCC AAATATCCTG CACAATTGCA TGGGATTAAA 
AGAATAGACC AATGTAAAAG TCTAACAATA CTTTGATGGG TTTATAGGAC GTGTTAACGT ACCCTAATTT 
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2760 2770 2780 2790 2800 2810 2820 

TTTTAGTTTC TTTGAAATAG AAGTAGAGTT GTATTGCTGT CACGTCATCA AATAGTTCTG AAGCTATGAA 
AAAATCAAAG AAACTTTATC TTCATCTCAA CAT7VACGACA GTGCAGTAGT TTATCAAGAC TTCGATACTT 

2830 2840 2850 2860 2870 2880 2890 

TAAATAAGTT CCGCATTTGT TAGTGATTCT TTGAACATTA GAATTGTTAT GCTTAAGTAG ATAGGGTTAT 
ATTTATTCAA GGCGTAAACA ATCACTAAGA AACTTGTAAT CTTAACAATA CGAATTCATC TATCCCAATA 

2900 2910 2920 2930 2940 2950 2960 

GTTTGTTTGG AGTTCCCTTA AATCATTTCA TTGCTGACTG CCAGCTGGCA GGAGCATTTG TTGTTGCCTT 
CAAACAAACC TCAAGGGAAT TTAGTAAAGT AACGACTGAC GGTCGACCGT CCTCGTAAAC AACAACGGAA 

2970 2980 2990 3000 3010 3020 3030 

GACCATGAAT GAAGACCTTC CTGTTCTGAG TGCTCACAAG AAAACATATT TTGATTAATG CACCTTGAAT 
CTGGTACTTA CTTCTGGAAG GACAAGACTC ACGAGTGTTC TTTTGTATAA AACTAATTAC GTGGAACTTA 

3040 3050 3060 3070 3080 3090 3100 

CCTTAGGATC TTGCAAAGAT GGGCACTTAG CTTTAGAATT GAGTAGTACT TAAATAGCTG TTGTTATCAT 
GGAATCCTAG AACGTTTCTA CCCGTGAATC GAAATCTTAA CTCATCATGA ATTTATCGAC AACAATAGTA 

3110 3120 3130 3140 3150 3160 3170 

GATTTGTCCT GTAGTGAAAT GTCGACAAAA CAGGAATGCT ACTTTTGACT TCTGATATTT CATGCCTGGC 
CTAAACAGGA CATCACTTTA CAGCTGTTTT GTCCTTACGA TGAAAACTGA AGACTATAAA GTACGGACCG 

3180 3190 3200 3210 3220 3230 

TTTACTTATG CTCTGTTTGG AACATGGGCA CATATCA GGC AAT GCT ACT CCA GTT CAA AAC ATG 
AAATGAATAC GAGACAAACC TTGTACCCGT GTATAGT CCG TTA CGA TGA GGT CAA GTT TTG TAG 

GNATPVQNM> 
c c c EXON3 c c c > 



3240 3250 3260 3270 3280 3290 
CTA AGT GGC CCA AGT GGG GGA TCG GGC TCA CAG TTG GTA CAG AAT GTT GAT GTC CTT 
GAT TCA CCG GGT TCA CCC CCT AGC CCG AGT GTC AAC CAT GTC TTA CAA CTA CAG GAA 
LSGPSGGSGSQLVQNVDVL> 
_c c c c c c c c EX0N3 c c c c c c c c : 

3300 3310 3320 3330 3340 
GTA AAG CAG CCC ACC AGC TCT TCA TCA AGG GAG CAG TCA GAT GAT GAT GAC ATG AAG 
CAT TTC GTC GGG TGG TCG AGA AGT AGT TCC CTC GTC AGT CTA CTA CTA CTG TAG TTC 
VKQPTSSSSREQSDDDDMK> 
c c c c c c c c EXON3 c c c c_ c c c c : 

3350 3360 3370 3380 3390 3400 
GGA GAA GCT GAG ACC ACT GGA ACT GCA AGA CCT GCT GAT CAA AGA TTA CAA CGA 
CCT CTT CGA CTC TGG TGA CCT TGA CGT TCT GGA CGA CTA GTT TCT AAT GTT GCT 
GEAETTGTARPADQRLQR> 
c c c c c c c EX0N3 _c c c c c c c c > 
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3410 3420 3430 3440 3450 3460 3470 

AGGTGATC ATTCATTGCT TCCTTGTAAT ATAGATTCTG TACATAATTA ACCTACCTCG TCATGCATGC 
TCCACTAG TAAGTAACGA AGGAACATTA TATCTAAGAC ATGTATTAAT TGGATGGAGC AGTACGTACG 

3480 3490 3500 3510 3520 3530 3540 

ATGTGTCCTA TTTTCACCTT AGCCCTTTCA GTTGGATTTC CACTTTCATC CGGTAGCCTT TCAGTTTCCT 
TACACAGGAT AAAAGTGGAA TCGGGAAAGT CAACCTAAAG GTGAAAGTAG GCCATCGGAA AGTCAAAGGA 

3550 3560 3570 3580 3590 3600 3610 

ATTGCATCGC ATATATGATC TTTTACCTAC CATATTAGTT CTCTGTGTGC CATACTCAGT GCTTAGTGTC 
TAACGTAGCG TATATACTAG AAAATGGATG GTATAATCAA GAGACACACG GTATGAGTCA CGAATCACAG 

3620 3630 3640 3650 3660 3670 3680 

TCGAGCAAGA GAGGAATTTG TATGGCTATT ACACGTAGCA CTTTGCTCTC TACTTGTTTA TTGACATAAG 
AGCTCGTTCT CTCCTTAAAC ATACCGATAA TGTGCATCGT GAAACGAGAG ATGAACAAAT AACTGTATTC 

3690 3700 3710 3720 3730 3740 3750 

CAATTTGGGA TGAATTAAAT CTGAGTTCAC ATCATATTCC TTATGTCACA AGTTTCTGAA ACCGATTGTA 
GTTAAACCCT ACTTAATTTA GACTCAAGTG TAGTATAAGG AATACAGTGT TCAAAGACTT TGGCTAACAT 

3760 3770 3780 3790 3800 3810 3820 

TCTAGTATCT GGTTGATGCA CCCCCATCTT GGATTTGCAA ATCAAAGTTA TACTCCCTAG AGAGCTTTAC 
AGATCATAGA CCAACTACGT GGGGGTAGAA CCTAAACGTT TAGTTTCAAT ATGAGGGATC TCTCGAAATG 

3830 3840 3850 3860 3870 3880 3890 

CTTTCATAAA GCAATTACCC CAATAAACCA CGGATTTGAT AGCTATTGAC TATGATTACC AGAATTCATT 
GAAAGTATTT CGTTAATGGG GTTATTTGGT GCCTAAACTA TCGATAACTG ATACTAATGG TCTTAAGTAA 

3900 3910 3920 3930 3940 3950 3960 

TGGCAGCTAT TTTCTCAATT T/y^GTTTGGT ATTAGTCTCA GTTGGCTGTA AAATAATGTC ACGGTAGGGT 
ACCGTCGATA AAAGAGTTAA ATTCAAACCA TAATCAGAGT CAACCGACAT TTTATTACAG TGCCATCCCA 

3970 3980 3990 4000 4010 4020 4030 

ACATGTATGT GCAGCATACA AGGTATGGGT GAGTTATGAT ATGGACAGTG TGTACACCCC ACATTTGCTC 
TGTACATACA CGTCGTATGT TCCATACCCA CTCAATACTA TACCTGTCAC ACATGTGGGG TGTAAACGAG 

4040 4050 4060 4070 4080 4090 4100 

ACTAAAATCA AAATATTCAA ACGTCACGTG ATGATATGGT GGATTGCATT ATACCTTGTA TTGTTTATTA 
TGATTTTAGT TTTATAAGTT TGCAGTGCAC TACTATACCA CCTAACGTAA TATGGAACAT AACAAATAAT 

4110 4120 4130 4140 4150 4160 4170 

TGTTACTTGT GCTAGACAAT AATATAGGCT GTTCTTTTGG GTGATTTTGT ATGAAGATGT TGAGCAAGCA 
ACAATGAACA CGATCTGTTA TTATATCCGA CAAGAAAACC CACTAAAACA TACTTCTACA ACTCGTTCGT 

4180 4190 4200 4210 4220 4230 

CTTCTCGATA TAATGCTAGT TTTGTTGACC TGTTCC AGG AAG CAA TCC AAT CGG GAG TCA GCC 
GAAGAGCTAT ATTACGATCA AAACAACTGG ACAAGG TCC TTC GTT AGG TTA GCC CTC AGT CGG 

RKQSNRESA> 
d d d EXON4 d d d > 
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4240 4250 4260 4270 4280 4290 

AGG CGC TCA AGA AGO AGA AAG GCA GOT CAC TTG AAT GAG CTG GAG GCA CAG GTGTGA 
TCC GCG AGT TCT TCG TCT TTC CGT CGA GTG AAC TTA CTC GAC CTC CGT GTC CACACT 
RRSRSRKAAHLNELEAQ> 



_d EX0N4_ 



4300 4310 4320 4330 4340 4350 4350 

TAGTTCACAT AGTTATTTTC GATAAGACAT AAAATCCTAA ATTACTGGCT ACTGACTTCA GTTATGGATT 
ATCAAGTGTA TCAATAAAAG CTATTCTGTA TTTTAGGATT TAATGACCGA TGACTGAAGT CAATACCTAA 

4370 4380 4390 4400 4410 4420 
TACTTGTTAC AG GTA TCG CAA TTA AGA GTC GAG AAC TCC TCG CTG TTA AGG CGT CTT GCT 
ATGAACAATG TC CAT AGC GTT AAT TCT CAG CTC TTG AGG AGC GAC AAT TCC GCA GAA CGA 
VSQLRVENSSLLRRLA> 
e G e e e e EXON5 e e e e e e e > 



4430 4440 4450 4460 4470 

GAT GTT AAC CAG AAG TAG AAT GAT GCT GCT GTT GAC AAT AGA GTG CTA AAA GCA GAT 
CTA CAA TTG GTC TTC ATG TTA CTA CGA CGA CAA CTG TTA TCT CAC GAT TTT CGT CTA 



_EX0N5_ 



4480 4490 4500 4510 4520 4530 4540 

GTT GAG AGO TTG AGA GCA AAG GT ATGCTATATA TGCCTTTTGC AATATGCATC CCATGGATTG 
CAA CTC TGG AAC TCT CGT TTC CA TACGATATAT ACGGAAAACG TTATACGTAG GGTACCTAAC 

V E T L R A K> 
e e EXON5 e e > 

4550 4560 4570 4580 4590 4600 4610 

CTACTTTGGC TTGTTTCAAA CTTTCAACGT GACTTGTGTA CCCTGTTATT AGAAGAATAA TCCCGCCTAC 
GATGAAACCG AACAAAGTTT GAAAGTTGCA CTGAACACAT GGGACAATAA TCTTCTTATT AGGGCGGATG 

4620 4630 4640 4650 4660 4670 4680 

CATTATACTC TATAAATCAC CATTTGGCCA GTCCAAACAT GATTATTAAA TCAGGTCAAT CTGAACATTG 
GTAATATGAG ATATTTAGTG GTAAACCGGT CAGGTTTGTA CTAATAATTT AGTCCAGTTA GACTTGTAAC 

4690 4700 4710 4720 4730 4740 
AAATGTATCA AAAATTCGCA GGTG AAG ATG GCA GAG GAC TCG GTG AAG CGG GTG ACA GGC 
TTTACATAGT TTTTAAGCGT CCAC TTC TAG CGT CTC CTG AGC CAC TTC GCC CAC TGT CCG 
KMAEDSVKRVTG> 
f f f f EX0N6 f f f f f > 

4750 4760 4770 4780 4790 
ATG AAC GCG TTG TTT CCC GCC GCT TCT GAT ATG TCA TCC CTC AGC ATG CCA TTC AAC 
TAC TTG CGC AAC AAA GGG CGG CGA AGA CTA TAG AGT AGG GAG TCG TAG GGT AAG TTG 
MNALFPAASDMSSLSMPFN> 
f f f f f f f f EX0N6 f f f f f f f f > 
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4800 4810 4820 4830 4840 4850 

AGC TCC CCA TCT GAA GCA ACQ TCA GAC GCT GCT GTT CCC ATC CAA GAT GAC CCG AAC 
TCG AGO GGT AGA CTT CGT TGC AGT CTG CGA CGA CAA GGG TAG GTT CTA CTG GGC TTG 



4860 4870 4880 4890 4900 4910 
AAT TAC TTC GCT ACT AAC AAC GAC ATC GGA GGT AAC AAC AAC TAC ATG CCC GAC ATA 
TTA ATG AAG CGA TGA TTG TTG CTG TAG CCT CCA TTG TTG TTG ATG TAC GGG CTG TAT 
NYFATNNDIGGNNNYMPDI> 
f f f f f f f f EX0N6 f f f f f f f f > 



4920 4930 4940 4950 4960 

CCT TCT TCG GCT CAG GAG GAC GAG GAC TTC GTC AAT GGC GCT CTG GCT GCC GGC AAG 
GGA AGA AGC CGA GTC CTC CTG CTC CTG AAG CAG TTA CCG CGA GAC CGA CGG CCG TTC 
PSSAQEDEDFVNGALAAGK> 

f f f f f f f f EX0N6 f f f f f f f > 

4970 4980 4990 5000 5010 

5020 

ATT GGC CGG CCA GCC TCG CTG CAG CGG GTG GCG AGC CTG GAG CAT CTC CAG AAG AGG 
TAA CCG GCC GGT CGG AGC GAC GTC GCC CAC CGC TCG GAC CTC GTA GAG GTC TTC TCC 
IGRPASLQRVASLEHLQKR> 

f f f f f f f f EXON6 f f f f f f f f > 

5030 5040 5050 5060 5070 5080 

ATG TGC GGT GGG CCG GCT TCG TCT GGG TCG ACG TCC TGA GACCGA AACCCAGAGC 
TAC ACG CCA CCC GGC CGA AGC AGA CCC AGC TGC AGG ACT CTGGCT TTGGGTCTCG 

MCGGPASSGSTS*> 
f f f f f EX0N6 f f f f f > 

5090 5100 5110 5120 5130 5140 5150 

TGCTTCGGTT CTGAAAGACA CTGCGAGCAG GAAATGATGA TTGGACAGGC GTAGACATTG CTAATGCTGT 
ACGAAGCCAA GACTTTCTGT GACGCTCGTC CTTTACTACT AACCTGTCCG CATCTGTAAC GATTACGACA 

5160 5170 5180 5190 5200 5210 5220 

GAGGTTGATG ATTGTTGGTC GTCGTCGTCG TCATTGTGCA TTCTTTGTAA GGGACACCTC TTAGTACCCT 
CTCCAACTAC TAACAACCAG CAGCAGCAGC AGTAACACGT AAGAAACATT CCCTGTGGAG AATCATGGGA 

5230 5240 5250 5260 5270 5280 5290 

CTTCTTCTAA GGGACTTAGT ACCCCTTGTG GATCTCATCG TCCTAAATAC TATACACATT AGCCAAATGT 
GAAGAAGATT CCCTGAATCA TGGGGAACAC CTAGAGTAGC AGGATTTATG ATATGTGTAA TCGGTTTACA 



> terminator 



5300 5310 5320 5330 5340 5350 5360 

TCATTGGTGT GATGGCGTCG TCCCTAATTT GAACGACTGA TTTCAGGCAG CTGCTATGCT ATCATTCAAT 
AGTAACCACA CTACCGCAGC AGGGATTAAA CTTGCTGACT AAAGTCCGTC GACGATACGA TAGTAAGTTA 
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